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ABSTRACT 

L7Ae is a member of a protein family that binds kink-turns (k-turns) in many functional RNA species. We have solved the X-ray 
crystal structure of the near-consensus sequence Kt-7 of Haloarcula marismortui bound by Archaeoglobus fulgidus L7Ae at 
2.3-A resolution. We also present a structure of Kt-7 in the absence of bound protein at 2.2-A resolution. As a result, we can 
describe a general mode of recognition of k-turn structure by the L7Ae family proteins. The protein makes interactions in the 
widened major groove on the outer face of the k-turn. Two regions of the protein are involved. One is an a-helix that enters 
the major groove of the NC helix, making both nonspecific backbone interactions and specific interactions with the guanine 
nucleobases of the conserved G»A pairs. A hydrophobic loop makes close contact with the L1 and L2 bases, and a glutamate 
side chain hydrogen bonds with LI. Taken together, these interactions are highly selective for the structure of the k-turn and 
suggest how conformational selection of the folded k-turn occurs. 
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INTRODUCTION 

The L7Ae family of proteins (Koonin et al. 1994; Wafkins 
et al. 2000) include the ribosomal proteins L7Ae and L30e, 
yeast snu31p, human 15.5-kDa protein (Nottrott et al. 
1999), and bacterial homologs such as YbxF (Baird et al. 
2012). These proteins bind to kink-turns (k-turns) in duplex 
RNA, stabilizing their folded conformation. These interac- 
tions are found in most types of functional RNA. In the ribo- 
some, they mediate tertiary interactions and stabilize the 
structure. In box C/D snoRNPs, the binding of an L7Ae fam- 
ily protein to k-turn structures is the first step of assembly 
(Kuhn et al. 2002; Watkins et al. 2002) that then serves to re- 
cruit additional proteins to generate the functional apparatus 
for the fibrillarin-catalyzed site-specific methylation of RNA. 
L7Ae is also a critical component of the box H/ACA snoRNP 
(Rozhdestvensky et al. 2003) and U3 snoRNP (Marmier- 
Gourrier et al. 2003). In mammals, the 15.5-kDa protein 
binds to a k-turn in the U4 snRNA during the spliceosome 
cycle (Nottrott et al. 1999; Vidovic et al. 2000). L7Ae has re- 
cently been shown to be a sub unit of archaeal RNaseP (Cho 
et al. 2010) required for tRNA maturation. Thus, L7Ae-fam- 
ily proteins are rather general k-turn binding proteins that are 
important in ribosome structure, spliceosome assembly, and 
the site-specific modification of RNA. 
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The standard k-turn comprises a 3-nucleotide (nt) bulge 
followed on its 3' side by G*A and A*G pairs, and usually 
one further non- Watson-Crick pair (Fig. 1A). Although sub- 
stitutions can be tolerated in most positions in varying de- 
grees in some sequence environments, k-turns that are 
close to the consensus sequence appear to adopt the most sta- 
ble folded conformation. Kt-7 of the Haloarcula marismortui 
50S ribosomal subunit 23S rRNA is very close to being a con- 
sensus k-turn, and its interaction with Archaeoglobus fulgidus 
L7Ae protein has been extensively studied. It was shown that 
binding of L7Ae strongly shifted the conformational popula- 
tion to the folded state (Turner et al. 2005), with an apparent 
binding affinity of 10 pM (Turner and Lilley 2008). Recently 
in single-molecule experiments, we detected no folding inter- 
mediates down to a timescale of 8 msec, consistent with a 
conformational selection mechanism (Wang et al. 2012). A 
crystal structure at 2.7-A resolution was determined for 
L7Ae bound to the box C/D k-turn (Moore et al. 2004) 
and for L7Ae bound to Kt-15 in the ribosome (Ban et al. 
2000). However, Kt-15 is a complex k-turn whose structure 
does not map linearly onto the sequence. 

Kt-7 exhibits an interesting structural plasticity in terms 
of the hydrogen bonding in the core (Daldrop and Lilley 
2013). Like most k-turns, the folded structure of Kt-7 is sta- 
bilized by two near-universal cross-strand hydrogen bonds, 
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FIGURE 1. Sequence of Kt-7 and the RNA species used for crystalliza- 
tion. (A) The sequence of H. marismortui Kt-7, with key nucleotides la- 
beled using our standard nomenclature (Liu and Lilley 2007). (B) A 
stem-loop RNA containing a single Kt-7 sequence. This was used for 
the crystallization of the complex of A. fulgidus L7Ae bound to Kt-7. 
(C) A duplex RNA containing two Kt-7 sequences with twofold rota- 
tional symmetry. This was crystallized in the absence of protein. 



accepted by the nucleobases of the two conserved adenines. 
LI 02' donates its proton to Nl of the Aln, whereas the 
-In 02' donates its proton to either N3 or Nl of A2b. 
Almost all the known k-turn structures divide into one of 
two classes, depending on whether the acceptor is N3 or 
Nl. We have previously shown that when the Kt-7 sequence 
is inserted into the SAM-I riboswitch, it is a member of the 
N3 class (Daldrop and Lilley 2013); 
whereas in the ribosome, it is an Nl class 
structure. This conformational switch re- 
sults in a significant change of the rota- 
tional setting of the NC helix of the k- 
turn, with consequences for any tertiary 
interaction made by that helix. No atom- 
ic resolution structure has been deter- 
mined to date for Kt-7 bound to a L7Ae 
family protein, so its structural class un- 
der these conditions is unknown. Nor is 
the structure of Kt-7 known in the ab- 
sence of bound proteins or tertiary 
constraints. 

We therefore decided to determine 
the crystal structure of the A. fulgidus 
L7Ae protein bound to the archetypal 
k-turn Kt-7 from H. marismortui, given 
the wealth of biophysical data that has 
been collected for this interaction. We 
have succeeded in obtaining a crystal 
structure of the complex at 2.3 A. This 
is the highest resolution obtained to 
date for the complex of L7Ae with a 
standard k-turn, with excellent elec- 
tron density that well defines the protein 
side chains in their interaction with the 
k-turn RNA. From this we present a ge- 
neral model for the recognition of k- 
turn structure by the L7Ae family of 
proteins. 



RESULTS 

Crystallization of a complex of Kt-7 with L7Ae 

Well-diffracting crystals were obtained from an equimolar 
mixture of L7Ae with a 26-nt RNA oligonucleotide, where 
the C helix was closed by a GAAA tetraloop (Fig. IB). The 
structure was determined by molecular replacement and 
subsequently refined to 2.3-A resolution. In addition, we 
also succeeded in obtaining crystals of a 19-nt RNA contain- 
ing the Kt-7 sequence in the absence of protein. The RNA 
formed a self-complementary duplex with two k- turns related 
by twofold symmetry (Fig. 1C). The structure was deter- 
mined by molecular replacement and refined to 2.2-A reso- 
lution. Crystallographic statistics are presented in Table 1. 

The interaction of L7Ae with Kt-7 

L7Ae adopts its standard fold in the complex (Fig. 2), com- 
prising a four-strand mixed p-sheet flanked on both sides 
by a-helical sections. In overall terms, the structure is similar 
to that of the complex with the box C/D k-turn (Moore et al. 
2004), with an RMSD of 0.46 A. The protein binds in the 
major groove on the outer face of the Kt-7 k-turn, and 
two sections form the principle contacts with the RNA. 



TABLE 1. Details of data collection and refinement statistics for the crystallographic data as 


deposited in the PDB 






Details of data collection 


Macromolecules 


L7Ae with Kt-7 


Kt-7 


PDB code 


4BW0 


4C40 


Space group 


P4 3 2,2 


P6 3 22 


Unit cell dimensions (A) 


a = 61 .92 


a = 70.25 




6 = 61.92 


6 = 70.25 




c = 130.97 


c = 47.70 




a=P = Y = 90.00° 


« = p = 90.00°, 7 = 120.00° 


Resolution range (A) 


61.92-2.33 (2.41-2.33) 


37.54-2.20 (2.28-2.20) 


Observations 


73857 


39137 


Unique observations 


11309 (1031) 


3827 (371) 


Completeness (%) 


98.23 (92.38) 


99.92 (100.00) 


<l>/<o(l)> 


20.15 (2.93) 


15.60 (3.17) 


Emerge (%) 


5.6 (43.2) 


9.2 (76.0) 


Multiplicity 


6.5 (3.4) 


10.2 (10.6) 


Refinement statistics 


Resolution range (A) 


55.98-2.33 (2.41-2.33) 


37.54-2.20 (2.28-2.20) 


R-factor 


0.1804 (0.2489) 


0.2227 (0.4108) 


R-free 


0.2009 (0.3030) 


0.2640 (0.4385) 


Number of atoms 3 


1479/5/53 


417/0/14 


Mean B-factor b (A 2 ) 


39.70/33.80 


47.70/48.90 


RMS bond length deviation (A) 


0.007 


0.005 


RMS bond angle deviation (°) 


1.06 


1.00 


Ramachandran favored (%) 


98 




Ramachandran outliers (%) 


0 




Error in coordinates by Luzzati plot 


0.3030 


0.4221 


a Number of atoms for macromolecules, ligand, and water molecules, respectively. 


Mean B-factors for macromolecules and solvent molecules, res 


pectively. 
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FIGURE 2. The structure of the complex of L7Ae bound to Kt-7. 
Stereoscopic pairs (A,D,E) Are shown as parallel-eye stereoscopic pairs. 
Electron density is taken from the composite omit map and contoured 
at 2a. (A) An overall view of the complex. The basic (3-strand:turn:helix 
and hydrophobic loop sections are colored blue, with hydrogen bonds 
between amino acid side chains and the RNA highlighted in red. The 
nucleotides of the k-turn are colored in our conventional scheme 
(loop purple; G'A pairs green; NC helix gray; C helix yellow), with 
the NC and C helices indicated. (B,C) Nonstereoscopic images of the 
structures of the two recognition elements of the protein, i.e., the a-helix 
(B) and the hydrophobic loop (C). (D,E) The interaction between the a- 
helix (D) and the hydrophobic loop (£) of the k-turn RNA. The electron 
density map is shown for the protein. Hydrogen bonds between amino 
acid side chains and the RNA are highlighted in red. Note that the N-ter- 
minal end of the a-helix is directed toward 06 of Gib. 

The definition of the electron density of the new structure is 
excellent, and we can position most amino acid side chains 
with good confidence and thus identify important contacts 
with the k-turn. To exclude the possibility of model bias, 



we have calculated a composite omit map of the structure, 
and all the electron density shown is taken from that. 

One major set of contacts is made by the second a-helix 
from the N terminus, comprising residues 32-41, together 
with the preceding short (3-strand (residues 27-30) that is 
part of the four-strand (3-sheet (Fig. 2B,D). The helix is direct- 
ed into the major groove of the NC helix, with its N-terminal 
end directed toward the guanine nucleobases of the two G*A 
pairs of the k-turn. The p-strand:turn:helix element is highly 
basic, and side chains make a number of contacts with the 
RNA. At the C-terminal end of the a-helix, basic side chains 
from successive turns are hydrogen bonded to backbone phos- 
phate proS nonbridging oxygen atoms on the nonbulged 
strand of the NC helix. Arginine R41 is bonded to the 3n/4n 
phosphate (N-O distance = 3.0 A) and lysine K37 to the 4n/ 
5n phosphate (N-0 distance = 2.7 A). The side chains of ly- 
sine residues K27 and K29 are directed toward the backbone 
of the same nonbulged strand; these are too distant to form hy- 
drogen bonds but could contribute to the electrostatic stabili- 
zation of the complex. At the N-terminal end of the a-helix, 
two successive amino acid side chains make specific hydro- 
gen-bonded contacts to the Gib and G2n nucleobases. 
Glutamate E34 accepts a proton from Nl of Gib (N-0 dis- 
tance = 3.0 A), and glutamine N33 donates a proton to 
06 of G2n (N-O distance = 2.6 A). These two interactions 
specifically recognize the position of the conserved guanine 
nucleobases of the k-turn in the major groove. Additionally, 
the positive pole of the helix dipole is directed toward the 
Gib 06 atom, providing significant electrostatic stabilization. 

The other major protein-RNA contact occurs at the loop 
comprising residues 86-93 (Fig. 2C,E). This has the sequence 
VGIEVPC. This very hydrophobic loop makes close van der 
Waals contact with the L2 and LI nucleobases of the k-turn 
loop. The hydrocarbon side chains of isoleucine 188 and va- 
line V90 are on the lower face, with 188 directly facing the L2 
nucleobase in its syn conformation. Altogether the contact 
buries 732 A 2 . The single charged amino acid of the loop, glu- 
tamate E89 is hydrogen bonded to Nl of GL1 (N-O distance 
= 2.9 A); the position of this side chain is very well defined by 
the electron density. 

The structure of Kt-7 in the complex 

Within the complex with L7Ae, Kt-7 adopts a standard 
N3-class k-turn structure (Fig. 3A,B). The C helix is fully 
base-paired, and the loop nucleotides adopt their normal 
positions, i.e., the GL1 and AL2 nucleobases are stacked 
onto the C and NC helices, respectively, and AL3 is directed 
away from the RNA. AL2 has a syn glycosyl bond conforma- 
tion, positioning it to stack on Aln. The G*A pairs at the lb* 
In and 2n-2b positions are conventional trans G (sugar)-A 
(Hoogsteen) pairs, followed by another in the 3n - 3b position, 
after which Watson- Crick pairing resumes. Kt-7 has the stan- 
dard cross-strand A-minor hydrogen bonds (Liu and Lilley 
2007; Daldrop and Lilley 2013), from LI 02' to Aln Nl 
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FIGURE 3. The structure of Kt-7 in the complex with L7Ae and in the 
absence of bound protein. These are shown as parallel-eye stereoscopic 
pairs. (A) The complete k-turn structure, viewed from the side of the 
bulged strand, with the electron density shown contoured at 2a is taken 
from the composite omit map. (B,C) The structures of Kt-7 in the com- 
plex (B) and protein-free (C), both viewed from the nonbulged strand 
side with the C-helix directed rightward. 

(O-N = 2.6 A) and from G-ln 02' to A2b N3 (O-N = 2.7 A). 
Thus, the structure falls into the N3 class of k-turn structures 
(Daldrop and Lilley 2013). The near-universal hydrogen 
bond from AL3 02' to P1/P2 O proS (0-0 = 2.3 A) closes 
the loop of the k-turn. All the hydrogen bonds identified 
have acceptable angular geometry. 

The structure of Kt-7 in isolation 

We have also solved the structure of Kt-7 as a duplex RNA in 
the absence of bound L7Ae, at a resolution of 2.2 A (Fig. 3C). 
The structure of the k-turn is closely similar to that in the 
L7Ae-bound complex. The two structures superimpose 
with an RMSD of 0.83 A (Fig. 4). All the standard hydrogen 
bonds are present, including the cross-strand A-minor inter- 
actions (Liu and Lilley 2007; Daldrop and Lilley 2013) 
and the hydrogen bond from AL3 02' to P1/P2 O proS 
(0-0 = 2.6 A). G-ln 02' donates its proton to A2b N3 
(O-N = 2.5 A), and thus the structure is a member of the 
N3 class of k-turn structures (Daldrop and Lilley 2013). 

Variability of adenine 2n base-pairing and its A-minor 
interaction with -1 n 02' 

The structure of Kt-7 falls in the N3 class in both the complex 
with L7Ae and as a protein-free duplex. The angular position 



of the A2b nucleobase thus permits both hydrogen bonds to 
form with G2n, with a normal bond from A2b N6 to G2n N3 
(N-N = 3.1 A for the complex and 3.0 A for the isolated k- 
turn). The consequent rotational setting of the helical axis re- 
sults in a long distance between A3b 02' and C-2n 02' (0-0 
= 6.8 and 6.9 A for the complex and free k-turn, respectively). 
The structures of the G2n-A2b-G-ln interactions are com- 
pared with the corresponding regions of Kt-7 in the SAM- 1 
riboswitch (Daldrop and Lilley 2013) and the ribosome 
(Ban et al. 2000) in Figure 5. Kt-7 adopts the N3 structure 
in the L7Ae complex, free duplex, and the SAM-I riboswitch, 
whereas in the ribosome, it forms the Nl structure, with a 
consequent loss of the A2b N6 to G2n N3 hydrogen bond. 
The structure of the free Kt-7 in particular suggests that the 
N3 structure is intrinsically more stable than the Nl struc- 
ture, perhaps because of a more stable G2n*A2b pair. In 
the ribosome Kt-7 is bound by L24, and the terminal loop 
of the C helix makes a loop-loop interaction with the adja- 
cent stem-loop, connected via a three-way helical junction. 
Given the change in axial rotation between the N3 and Nl 
structures (Daldrop and Lilley 2013) (which is also the reason 
for the marked lengthening of the A3b-C-2n 02' to 02' dis- 
tance), it is probable that the orientational requirements of 
the tertiary interactions stabilize the Nl conformation in 
the ribosome. This is a further demonstration of the confor- 
mational plasticity of the k-turn and its ability to accommo- 
date the structural requirements of its environment. 

DISCUSSION 

We have presented a structure of L7Ae bound to a standard k- 
turn at the highest resolution available to date, such that the 




FIGURE 4. Superposition of crystal structures of Kt-7. The structure of 
Kt-7 from the complex with L7Ae (PDB:4BW0) is shown in our stan- 
dard coloring. The structures are viewed from the side of the bulged 
strand. The images are shown in parallel-eye stereo. (A) Superimposi- 
tion with the structure of Kt-7 as a protein-free duplex colored cyan 
(PDB:4C40). The RMSD between the two structures is 0.83 A. (B) 
Superimposition with the structure of Kt-7 inserted into the SAM-I 
riboswitch (Daldrop and Lilley 2013) colored orange (PDB:4B5R). 
The RMSD between the two structures is 1.13 A. Note that all three 
Kt-7 structures are in the N3 conformation. 
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in SAM-I riboswitch 



FIGURE 5. Comparison of the G2n*A2b-A-ln interaction in four Kt-7 structures. (A) Bound by 
L7Ae protein (this work, with electron density from the composite omit map); (B) as free RNA 
(this work, with electron density from the composite omit map); (C) engineered into the SAM-I 
riboswitch (Daldrop and Lilley 2013); and (D) in the H. marismortui ribosomal 50S subunit 23S 
rRNA (Ban et al. 2000). Structures A-C fall into the N3 class, with G-ln 02' donating a proton to 
A2b N3 and two hydrogen bonds connecting A2b and G2n, whereas the ribosomal structure (D) 
is Nl class where G-ln 02' donates a proton to A2b Nl. The resulting rotation of the A2b nucle- 
obase results in an A2b N6 to G2n N3 N-N length of 4.3 A, i.e., too long to be hydrogen bonded. 



electron density for all the protein side chains can be ob- 
served with confidence. In light of this and comparison 
with other related structures available (Vidovic et al. 2000; 
Moore et al. 2004), we can describe the molecular recogni- 
tion of a k-turn by the L7Ae-related proteins in general terms. 

• These proteins bind exclusively in the major groove of the 
k-turn. This would not be possible in regular duplex RNA 
due to the deep, narrow characteristics of the major groove 
in double-stranded RNA. However the structure of the k- 
turn splays open the groove, forming a continuous rib- 
bon-like surface around the outer face of the structure, 
where it is very accessible. 

• The binding interface of L7Ae-related proteins comprises 
two elements; first is a highly basic (3-strand:turn:a-helix el- 
ement; and second, a short loop of hydrophobic residues 
(plus a glutamate at one end) that makes a 180° turn. 

• The three-turn a-helix enters the major groove of the NC 
helix so as to direct its N-terminal end toward the major 
groove edges of the guanine nucleobases of the G'A pairs. 
This resembles the entry of the recognition helices of he- 
lix-turn-helix elements interacting with DNA. 

• The C terminus of the a-helix lies close to the nonbulge 
strand of the NC RNA helix and contributes nonspecific in- 
teractions to the binding. The arginine in the third turn 
(R41 in A. fulgidus L7Ae) hydrogen bonds to the 3n/4n 
phosphate. This interaction is near-universal, proposed in 



the complexes of the same species L7Ae 
with box C/D and the human 15.5-kDa 
protein with U4 snRNA. The lysine at 
the second turn (K37 in A. fulgidus 
L7Ae) is more variable in its interactions 
but is bonded to the 4n/5n phosphate 
in the Kt-7 complex. Two of the basic 
side chains at the N-terminal end of the 
(3-sheet section are also directed toward 
the NC helix nonbulge strand, not close 
enough for hydrogen bonding, but these 
will contribute to nonspecific electro- 
static stabilization. 

The N-terminal end of the a-helix is in- 
volved in specific recognition of the core 
of the k-turn. The glutamate side chain 
from the first turn (E34 in A. fulgidus 
L7Ae) is hydrogen bonded to Nl of 
Gib. This interaction is also observed 
in the interactions of L7Ae with box 
C/D and the 15.5-kDa protein with U4 
snRNA and is likely to be universal. The 
adjacent asparagine residue (N33 in A. 
fulgidus L7Ae) is hydrogen bonded to 
06 of G2n, also observed in the box CI 
D interaction. Interestingly, in a similar 
complex of L7Ae with Telohenia solenop- 
sae Kt-23, where the 2n position is ade- 
nine (not guanine as in a standard k-turn), N33 is 
hydrogen bonded to N6 (L Huang and DMJ Lilley, unpubl.) . 
In the L7Ae-Kt-7 complex, as well as the other structures 
considered here, the 06 atom of Gib is located almost on 
the axis of the a-helix at the N-terminal end. Guanine 06 
has a significant partial negative charge, and this will be elec- 
trostatically stabilized by the positive pole of the helix dipole. 
This appears to be another universal interaction in the 
L7Ae-k-turn complexes. Lastly, the C-terminal lysine of 
the (3-strand is directed toward N7 of the guanine at the L2 
position. However, this is too long to be hydrogen bonded 
in the Kt-7 complex, and a similar situation probably exists 
in the box C/D and 15.5-U4 snRNA complexes. 
The hydrophobic loop covers the nucleobases in the L2 and 
LI position, making good van der Waals contact and bury- 
ing a surface area of 732 A 2 . The syn conformation of L2 
places it maximally in contact with the loop. Isoleucine 
188 in particular is located direcdy over GL2 in the Kt-7 
complex. A very similar interaction is observed in the box 
C/D and 15.5-U4 snRNA complexes. The glutamate of 
the loop (E89 in A. fulgidus L7Ae) is hydrogen bonded to 
Nl of the LI guanine, thus making a specific contact. 
This is also observed in the box C/D complex, but is not 
found in the complex between the 15.5-kDa protein with 
U4 snRNA, where the LI nucleotide is cytosine. However, 
the glutamate side chain is probably flexible enough to 
make contact with bases other than guanine. 
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• The combined effect of these elements is a highly specific 
molecular recognition of the structure of the k-turn in dou- 
ble-stranded RNA. By placing the a-helix in the major 
groove of the NC arm and capping the L2 nucleobase it ef- 
fectively "measures" the angle between these two elements 
in the major groove. This then juxtaposes the N terminus of 
the a-helix with the major groove side of the guanine bases 
of the two conserved G - A, where side chains make specific 
hydrogen bonds, and Gib 06 is placed to maximize the in- 
teraction with the positive pole of the helix dipole. 

The L7Ae proteins make no significant interaction with the C 
helix. Thus, these proteins should recognize a k-loop struc- 
ture (where the C helix is replaced by a terminal loop) equally 
well as a k-turn. For example, the L7Ae-family proteins bind 
the k-loops of the box H/ACA RNA in the assembly of the 
snoRNP required for pseudouridylation (Rozhdestvensky 
et al. 2003; Hamma and Ferre-D'Amare 2004; Li and Ye 
2006; Youssef et al. 2007). 

The folding of Kt-7 that occurs on the binding of L7Ae 
might in principle occur by conformational selection, or al- 
ternatively by a more active induced fit process (Fig. 6). We 
have studied the binding of Kt-7 to L7Ae in single molecule 
experiments and failed to detect any intermediate folded state 
down to a timescale of 8 msec (Wang et al. 2012). This is con- 
sistent with a conformational selection process, although we 
cannot exclude a protein-induced conformational change in 
the RNA on a faster timescale. However, a prerequisite of the 
conformational selection model is that the folded k-turn is in 
equilibrium with the unfolded RNA (as shown previously by 




FIGURE 6. Scheme showing two alternative pathways of k-turn folding 
upon binding to L7Ae protein. The Kt-7 RNA is in equilibrium between 
the extended and folded conformation in free solution, biased toward 
the extended state at low ionic concentration. In the conformational se- 
lection mechanism (the vertical equilibrium shown right), the L7Ae se- 
lectively binds to the folded form and thus draws the equilibrium toward 
this state. Alternatively, the protein could bind the extended state and 
mechanically force the RNA to adopt the k-turn structure in an induced 
fit process {left). The conformational selection model requires that L7Ae 
structure is complementary to that of the folded k-turn, and the struc- 
ture of the k-turn in the absence of protein is closely similar to that of the 
bound RNA. The structures determined here (shown far right) show 
that these requirements are met. 



time-resolved FRET experiments), and the conformation of 
the folded RNA in the free and protein-bound states are 
closely similar. Comparison of the two structures presented 
here show that this is the case with an all-atom RMSD of 
0.83 A. Thus, no conformational adjustment of the RNA 
structure should be required on binding to L7Ae. 

YbxF is a bacterial homolog of L7Ae protein (Sojka et al. 
2007). Interestingly, this protein binds k- turns but not k- 
loops. A high-resolution structure of the complex with the 
SAM-I riboswitch k-turn has been solved by Ferre 
D'Amare and coworkers (Baird et al. 2012). The structure 
of YbxF is similar to L7Ae; and the general orientation of 
the complex is similar, with the a-helix (although not preced- 
ed by a basic (3-strand) entering the NC helix major groove 
and directed toward 06 of Gib and the hydrophobic loop 
stacked over L2 and LI. Yet there is a striking difference in 
that none of the interactions with the side chains of the a-he- 
lix is observed. It may be relevant that the binding affinity of 
YbxF to the k-turn (K d = 270 nM) (Baird et al. 2012) is orders 
of magnitude weaker than that of L7Ae to Kt-7 (K d = 10 pM) 
(Turner and Lilley 2008). 

In the ribosome, k-turns are recognized by a number of 
different proteins that selectively bind to their cognate tar- 
gets. We have also found that S17 and L24 stabilize the kinked 
structure of their cognate k-turns on binding in a similar 
manner to L7Ae (L Huang and DMJ Lilley, unpubl.). The 
structures of these proteins have little in common, and they 
bind to different faces of the k-turn. For example, in the small 
ribosomal subunit, the binding domain of SI 7 comprises 
largely (3-sheet that binds to the C helix of Kt-1 1. The protein 
makes no contact with the lb or 2n nucleobases, but a long a- 
helix projects over the loop region with side chains stacked on 
the loop nucleobases. Sll places an antiparallel (3-sheet into 
the splayed out minor groove of the NC helix. Evidently, 
the k-turn presents a number of distinctive structural features 
that can be recognized by a variety of different binding 
proteins. 

However, L7Ae is the most extensive family of k-turn 
binding proteins. Binding of L7Ae family proteins to k-turns 
and k-loops is required in many aspects of RNA function in 
the cell, including translation (L7Ae and L30e in the ribo- 
some), site-specific RNA modification (L7Ae binding a k- 
turn in the formation of box C/D and H/ACA complexes), 
and spliceosome assembly (15.5-kDa protein binding U4 
snRNA). These interactions are therefore very important in 
many aspects of RNA function. 

MATERIALS AND METHODS 
RNA synthesis 

Ribooligonucleotides were synthesized using t-BDMS phosphora- 
midite chemistry (Beaucage and Caruthers 1981), as described in 
Wilson et al. (2001). Oligoribonucleotides were deprotected in 
25% ethanol/ammonia solution for 3 h at 20°C and evaporated to 
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dryness. Oligoribonucleotides were redissolved in 100 uL DMSO to 
which was added 125 uL 1 M triethylamine trihydrofluoride (TEA, 
3HF) (Sigma-Aldrich) and incubated for 2.5 h at 65°C to remove t- 
BDMS protecting groups. All oligonucleotides were purified by gel 
electrophoresis in polyacrylamide in the presence of 7 M urea. 
The full-length RNA product was visualized by UV shadowing. 
The band was excised and electroeluted using an Elutrap (What- 
man) into 45 mM Tris.borate (pH 8.5), 5 mM EDTA buffer for 8 
h at 200 V at 4°C. The RNA was precipitated with ethanol, washed 
once with 70% ethanol, and suspended in water. The concentration 
of RNA was determined by measuring the absorbance at 260 nm, 
using an extinction coefficient of 366.1 mM _1 cm _1 and 264.3 
mJVr'cm" 1 . 

Expression and purification of A. fulgidus L7Ae 

A gene encoding full-length A. fulgidus L7Ae was cloned into a mod- 
ified pET-Duetl plasmid (Novagen) (Huang et al. 2011) using the 
Hindlll and £eoRI sites. The L7Ae gene was fused upstream of a 
hexahistidine-encoding sequence with a PreScission-cleavable link- 
er. The hexahistidine-L7Ae fusion protein was expressed in Escher- 
ichia coli BL21-Gold (DE3) pLysS cells (Stratagene) induced with 0.2 
mM IPTG for 12 h at 20°C. Harvested cells were resuspended in 20 
mM Tris.HCl, pH (8.0), 500 mM NaCl, 10 mM imidazole, 1 mM 
phenylmethylsulfonyl fluoride (buffer A) and lysed by sonication. 
The protein suspension was heated for 20 min at 85°C in the pres- 
ence of 10 mM MgCl 2 to denature endogenous protein, and this was 
removed by centrifugation at 18,000 rpm for 30 min at 4°C. L7Ae 
was loaded onto a HisTrap column (GE Healthcare), washed with 
25 mM imidazole in buffer A, and the protein was eluted with 
500 mM imidazole in buffer A. The His 6 tag was cleaved from the 
L7Ae fragments by PreScission protease in 20 mM Hepes-Na (pH 
7.6), 100 mM NaCl, 0.5 mM EDTA for 16 h at 4-8°C. L7Ae was 
loaded onto a heparin column (GE Healthcare) and eluted at 250 
mM in a 50-2000 mM gradient of NaCl in 20 mM Hepes.Na (pH 
7.6). The protein was further purified using a Superdex 200 gel fil- 
tration column in a buffer containing 5 mM Tris.HCl (pH 8.0), 100 
mM NaCl. The protein concentration was measured by absorbance 
at 280 nm using a molar extinction coefficient of 5240 M _1 cm" 1 for 
L7Ae. This value was calculated on the basis of amino acid compo- 
sition. The protein was concentrated to 20 mg/mL in buffer contain- 
ing 5 mM Tris.HCl (pH 8.0), 100 mM NaCl, and stored at -20°C as 
aliquots. 

Crystallization, structure determination, 
and refinement 

Kt-7 complexed with L7Ae 

A number of RNA constructs containing the H. marismortui Kt-7 
sequence were used in crystallization trials of the L7Ae-HmKt7 
complex. Two main designs were investigated: One comprised a 
set of 25-28 nt RNA oligonucleotides, each generating a k-turn 
where the C helix was closed by a GAAA tetraloop (Fig. IB); and 
the other comprised 17-25 nt RNA oligonucleotides, containing 
two self-complementary copies of the Kt-7 sequence related by a 
twofold axis (Fig. 1C). 

A mixture of 0.25 mM RNA and 0.25 mM L7Ae in 5 mM Tris. 
HC1 (pH 8.0) and 100 mM NaCl was heated for 1 min to 85°C. 



The solution was slowly cooled to 20°C; MgCl 2 was added to a final 
concentration of 10 mM. Crystals were grown by vapor diffusion us- 
ing drops prepared by mixing 1.0 uL of the RNA-protein complex 
with 1 pL of a reservoir solution comprising 100 mM Hepes.Na 
(pH 7.0), 2.3 M ammonium sulfate at 20°C. Crystals (approximate 
dimensions 500 x 50 x 50 um 3 ) of space group P4 3 2!2 appeared af- 
ter 3 d. Crystals were transferred to a solution containing 100 mM 
Hepes.Na (pH 7.0), 2.0 M ammonium sulfate, and 30% glycerol 
for ~5 sec. The crystals were flash frozen by mounting in nylon 
loops and plunging into liquid nitrogen. The crystals were character- 
ized in the laboratory with a MicroMax HF007 copper rotating an- 
ode X-ray generator equipped with an ACTOR sample changer 
system and a Saturn 944HG+ CCD detector (Rigaku). Suitable crys- 
tals were stored and subsequently used to measure full data sets on 
beamline 103 of the Diamond Light Source (Harwell, UK). The crys- 
tals had unit cell dimensions a = 61.9 A, b = 61.9 A, and c = 131.0 A. 
From crystal density considerations (Mathews 1968), one RNA- 
protein complex was expected to be present in the asymmetric unit. 

The structure was determined by molecular replacement. In the 
first stage, the A. fulgidus 7Ae structure taken from PDB file 1RLG 
was used as the search model, followed by the H. marismortui 
Kt-7 structure taken from PDB file 4B5R, using the program 
PHASER (McCoy et al. 2007). The resulting electron density map 
revealed the remaining GAAA tetraloop and the N-terminal region 
of L7Ae, and a model was built using the difference map. Composite 
omit maps were calculated using PHENIX (Adams et al. 2010). 

Kt-7 as free RNA 

Three types of design were explored in attempts to crystallize Kt-7 
RNA in the absence of protein. These were (1) the same 25-28 nt 
RNA with a GAAA tetraloop closing the C-helix as above (Fig. 
IB); (2) 30-36 nt RNA in which the C-helix was a stem-loop closed 
by a U1A protein binding loop (data not shown); and (3) the 17-25 
nt RNA oligonucleotides with two inverted copies of the Kt-7 de- 
scribed above (Fig. 1C). 

A solution of 1 mM RNA in 5 mM Tris.HCl (pH 8.0) and 100 
mM NaCl was heated for 1 min to 95°C. The solution was slow 
cooled to 20°C; MgCl 2 was added to a final concentration of 10 
mM. The hanging-drop vapor diffusion method was used for crys- 
tallization. A volume of 1 .0 uL of prefolded RNA was mixed 1 : 1 with 
well solution comprising 3.5 M Na formate, 0.1 M Na acetate 
(pH 4.0) at 20°C. Crystals (approximate dimensions 100 x 10 x 10 
um 3 ) with space group P6 3 22 appeared after 5 d. Crystals were 
briefly washed in well solution supplemented with 30% glycerol. 
The crystals were flash frozen by mounting in nylon loops and 
plunging into liquid nitrogen. A 2.45 A resolution data set was col- 
lected on a tunable microfocus beamline 124 of the Diamond Light 
Source (Harwell, UK). Later, a bigger crystal (approximate dimen- 
sions 200 x 30 x 30 um 3 ) was obtained from which a 2.2-A resolu- 
tion data set was collected on beamline ID23-1 at the European 
Synchrotron Radiation Facility (Grenoble, France). The crystals 
had unit cell dimensions a = 70.2 A, b = 70.2 A, and c = 47.7 A. 
The molecular weight of a subunit is 6.3 kDa, and the asymmetric 
unit consists of one subunit with a VM value of 2.7 A 3 Da -1 and sol- 
vent content of ~69% (Mathews 1968). The structure was deter- 
mined by molecular replacement. H. marismortui Kt-7 (PDB 
4bw0) was used as the search model using the program PHASER 
(McCoy et al. 2007). The remaining nucleotides were added to the 
model based on inspection of electron density difference maps. 
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Both structural models were built in Coot (Emsley et al. 2010) 
and refined with Refmac5 (Murshudov et al. 2011) from the 
CCP4 suite of programs (Winn et al. 2011) and Phenix (Adams 
et al. 2010). Ramachandran analysis shows that 100% of amino 
acid residues are in the most favored and additionally allowed re- 
gions. Model geometry and the fit to electron-density maps were 
monitored with MOLPROBITY (Chen et al. 2010) and the valida- 
tion tools in COOT. 



DATA DEPOSITION 

Atomic coordinates and structure factor amplitudes have been 
deposited with the PDB with accession codes 4BW0 and 4C40. 
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